Remote control of electronic devices

ABSTRACT

A controlling device (e.g., a telephony device) can remotely control various tasks associated with a controlled device (e.g., a personal computer), including the navigation of user interfaces associated with applications or an operating system associated with the controlled device. A task can be controlled at the controlled device by mapping user input received at the controlling device to control commands suitable for execution at the controlled device.

RELATED APPLICATION

The subject matter of this patent application is related to U.S. patentapplication Ser. No. 10/956,720, filed Oct. 1, 2004, entitled “SpokenInterfaces,” which patent application is incorporated by referenceherein in its entirety.

TECHNICAL FIELD

The following disclosure generally relates to remote control ofelectronic devices.

BACKGROUND

Most computer systems include input devices for receiving user input(e.g., mouse, keyboard, etc.). Typically, input devices are local to thecomputer system and are connected to the computer system by wired orwireless connections. A user can use an input device to navigate agraphical user interface with a cursor or other pointing device, launchapplications and interact with the launched applications.

Users often desire to interact with home or office computer systems fromremote locations. If the user has access to a remote computer systemwith a display screen, then the user can control the home or officecomputer over a network (e.g., Internet, Ethernet, etc.) using aterminal emulation program (e.g., Telnet) or other publicly availableremote control application. These publicly available remote controlapplications display an image of the user's desktop, which allows theuser to remotely navigate and interact with their home or officecomputer system. For some users, however, a remote computer systemhaving a display device may not be available.

SUMMARY

The deficiencies described above or overcome by the disclosedimplementations of systems, method and devices for remotely controllingelectronic devices.

A first device (“the controlling device”) can remotely control varioustasks associated with a second device (“the controlled device”),including the navigation of user interfaces associated with applicationsor an operating system residing on the second device. A task can becontrolled at the controlled device by mapping user input received atthe controlling device to control commands suitable for controlling thetask at the controlled device. Some examples of tasks include navigatinguser interfaces and/or file systems, executing and controlling functionsor features associated with applications or an operating system,collecting information (e.g., system configuration information, stateinformation, etc.).

In some implementations, the controlling device can be a telephonydevice (e.g., smart phone, etc.). The controlling device can establishcommunication with the controlled device through a network access device(e.g., a modem). After communication is established, the user canremotely control the controlled device using, for example, the keypad ofthe phone. If the controlling device is a touchtone phone, then thecontrolled device can detect Dual-Tone Multi-Frequency (DTMF) digits(also referred to as “tones”) that are transmitted by the controllingdevice. The tones can be mapped to control commands or operating systemevents for controlling the controlled device.

In some implementations, a cursor can be navigated around a userinterface associated with a controlled device using a controllingdevice. As the cursor traverses the user interface the contents offiles, documents, web pages, mail messages, word processing files,links, controls and the like are converted into audible descriptions.The audible descriptions are transmitted back to the controlling deviceand provide audible feedback to a user of the controlling device, sothat the user can navigate and control the user interface without adisplay screen. This feature provides an advantage over conventionaltelephony systems where, for example, keys or tones are tied orhardwired to specific commands which can not be changed.

In some implementations, a method of remotely controlling an electronicdevice includes: establishing communication between a controlling deviceand a controlled device; receiving input from the controlling device;mapping the input to control commands; and executing the controlcommands at the controlled device.

In some implementations, a method includes: determining a first spatialposition in relation to a plurality of objects on a user interface;receiving through a network an input from a device; determining a secondspatial position on the user interface in response to the input; andoutputting an audio segment related to the second spatial position.

In some implementations, a method includes: determining a first state ofa device to be controlled; receiving through a network an input from aremote device; determining a second state of the device; and outputtingan audio segment related to the second state.

In some implementations, a method includes: receiving an input from acontrolling device; using the input to control a user interfaceassociated with a controlled device, the user interface having aplurality of objects, wherein controlling the user interface, includes:directionally stepping through the plurality of objects in the userinterface to select an object of interest; activating the object ofinterest in the user interface; and producing an output associated withthe activation of the object of interest in the user interface.

DESCRIPTION OF DRAWINGS

FIG. 1 is a flow diagram of an exemplary process for remote control ofelectronic devices.

FIG. 2 illustrates an exemplary keypad of a controlling device.

FIG. 3 is a block diagram of an exemplary software architecture for acontrolled device.

FIG. 4 is a block diagram of an exemplary hardware architecture for acontrolled device.

DETAILED DESCRIPTION Remote Control Process

FIG. 1 is a flow diagram of an exemplary process 100 for remote controlof electronic devices. The steps of process 100 do not have to occur inany specific order, and at least some steps can be performsimultaneously in a multithreading and/or multiprocessing environment.For the purposes of this example, the device will be described in thecontext of a personal computer (PC) (“the controlled device”) that iscontrolled by a touchtone phone having a numeric keypad (the“controlling device”).

It should be apparent that other devices and system configurations arepossible. For example, a controlled device can be a computer system, acomputing device, a personal computer, a client computer in aclient-server topology, a laptop computer, a portable electronic device,a telephone (landline or wireless), a television receiver (including ahigh definition television receiver), a set-top box, a personal digitalassistant (PDA), a portable media player, an embedded electronic deviceor appliance, a game console, or any other electronic device. Acontrolling device can be a POTS telephone, a mobile phone, a smartphone, and any other device capable of transmitting control informationto an electronic device over a communication medium.

The term “controlling” includes controlling tasks associated with thecontrolled device. Tasks can include but are not limited to navigatinguser interfaces and file systems (including navigating user interfacesassociated with applications and the operating system), invoking orlaunching applications or programs, functions and/or features associatedwith the controlled device or its peripherals, scrolling through lists,menus or browsers, text entry, collecting information (e.g., meta data,system configuration data), and any other activity that can be locallyperformed at the controlled device. For example, a user can remotelycontrol the movements of a cursor in a desktop user interface to selecta software application (e.g., email) from a menu, desktop icon, or otherselection mechanism, launch the selected application, then navigate orinteract with a user interface associated with the launched applicationto perform various tasks.

The process 100 begins when a communication channel is establishedbetween the controlling device and the controlled device (102). Acommunication channel can be established by the controlling devicecalling a telephone number associated with the controlled device. Thetelephone call can be placed using one or more communication networks,including but not limited to: the Public Switched TelecommunicationsNetwork (PSTN), the Internet (e.g., VoIP), wireless networks, etc. Thecontrolled device can include a telephony interface (e.g., dial-upmodem, cable modem, digital subscriber line (DSL), network interfacecard (NIC), etc.) for receiving the call and for establishingcommunication with the controlling device.

After communication is established, the user can use the keypad or otherinput mechanism of the controlling device to enter input, which isreceived by the controlled device (104). The input can be of the form ofkey stroke(s) or voice input from the user, which is converted by thecontrolling device into signals or packets, which can be carried ortransmitted over a communication network using one or more knowncommunication protocols (e.g., PCM, TCP/IP, GSM, etc.). In someimplementations, the controlling device is a touchtone phone and sendsDTMF tones to the controlled device. The controlled device can detectand interpret the tones using a suitable protocol for such purpose, suchas the publicly available Telephony Application Program Interface(TAPI).

In some implementations, the controlled device maps the detected signalsinto control commands or instructions that can be used to control andinteract with the controlled device (106). For example, the user canpress a key or key combination on the keypad to control one or moretasks, as described with respect to FIG. 2. The maps can be generatedand stored at the controlled device or downloaded from a network deviceor other source. A “map” translates input signals received from thecontrolling device (e.g., DTMF tones) and maps the signals into commandsor instructions that can be executed by a remote control application.The remote control application can be part of an operation system or anapplication. For example, a client application can be configured toreceive telephony information from a telephony server running on thecontrolled device, as described with respect to FIG. 3. An example of anapplication that can be remotely controlled is described in U.S. patentapplication Ser. No. 10/956,720, filed Oct. 1, 2004, entitled “SpokenInterfaces.”

In some implementations, mapping is performed by the controlling device.For example, the user can download maps into their mobile phone usingknown push technologies (PointCast™, Intermind™, etc.). The maps canthen be used to map the keys of a conventional keypad to a set ofcommands or operating system events that are suitable for controllingone or more tasks at the controlled device. There can be a different mapfor different tasks allowing for a wider variety of tasks to beperformed using the same keypad or keyboard. The maps can be generatedby software publishers or service providers, or can be generated byusers at the controlling device or through a web site. For example, auser can create a map using a tool hosted on a network. The map can thenbe downloaded into the phone in response to a key combination entered bythe user or some other trigger event. In some implementations, theappropriate maps can be sent with the input command signals to thecontrolled device, and the maps can be applied at the controlled device.In some implementations, a description of the appropriate maps can besent with the input command signals to the controlled device and themaps can be constructed and applied at the controlled device.

After the input is mapped to control command, the commands can beexecuted by one or more client applications residing on or associatedwith the controlled device (108), as described with respect to FIG. 3.In some implementations, the controlled device can provide feedback tothe controlling device (110). If the user is using the controllingdevice to navigate in a user interface associated with the controlleddevice, then the controlled device can send speech audio signals back tothe user, which can inform the user of the location of the cursor. Forexample, as the user navigates the cursor over various objects in theuser interface (e.g., buttons, links, files, images, application launchicons, etc.), a speech generator can generate files (e.g., .wav files)containing speech describing objects and/or their spatial positions onthe user interface as the objects are traversed by a cursor (e.g., “menubar”, “enter”, “file”, “email”, “Word”, etc.). The speech files can beconverted into signals (e.g., PCM signals) that are suitable fortransmission back to the controlling device where they can bereconstructed and played for the user through an audio system ordisplayed as text or images if the controlling device includes a displayscreen. Many types of feedback are possible, including video, stillimages, graphics, text messages and the like. In some implementations, afirst spatial position is determined in relation to a plurality ofobjects on a user interface associated with a controlled device. Thecontrolled device receives an input from a controlling device throughcommunication medium. A second spatial position is determined on theuser interface in response to the input. An audio segment related to thesecond spatial position is transmitted back to the controlling device.

Example Keypad Operation

FIG. 2 illustrates an exemplary keypad 212 of a controlling device 210(e.g., a telephony device). In some implementations, the keypad 212includes keys 215 for numbers 0 through 9, an asterisk key 235 and apound key 237. One or more of the keys 215 can be mapped to remotecontrol functions associated with tasks to be performed at thecontrolled device. In this example, the remote control functionsassociated with the keys 215 are shown in parentheses. In a normal modeof operation, the keys 215 can be used to perform typical functionsassociated with the controlling device (e.g., place a telephone call).In a remote control mode, the keys 215 perform remote control functions,as determined by the mapping then in effect on the controlled orcontrolling device. For example, when the user wants to use thecontrolling device 210 to remotely control another device, the user canselect a key combination (or dedicated function key, if available) toplace the controlling device 210 into the remote control mode. There canbe a different key combination for each mapping available in thecontrolled or controlling device. A user can invoke the appropriate mapfor the application or function that the user desires to control (e.g.,browser, email, operating system utilities, etc.) For example, whenremotely controlling an email application the user may want to map oneof the keys 115 to a “send” command and another to a “reply” command,which are commands often associated with email applications.

In the implementation shown in FIG. 2, the mapping provides for thenavigation of a user interface associated with the controlled device.For this particular mapping, the keys 2,4,6,8 are directional keys forcontrolling navigation in an upward, leftward, rightward, or downwarddirection, respectively, in a two-dimensional user interface. Bycontrast, the keys 0,1,3,5,7,9,*,# are functional keys.

Other mappings of the keypad 212 are possible. For example, the keypad212 can include one or more dedicated function keys for remotelycontrolling particular tasks at the controlled device. For example, thekeypad 212 could include dedicated keys for controlling email functions,such as send, reply, delete, etc.

The directional keys 2,4,6,8, can be used to control the movements of acursor in a user interface associated with the controlled device. Thiscan be a default mode of operation for the directional keys. Other modesof operation are possible. For example, one or more functional keys canbe manipulated to place the device in a different operational mode,where particular ones of the directional keys are mapped to differentnavigation or interaction events. The functional keys can be manipulatedto operate in a default mode, with each functional key associated with afunctional operation (e.g., save, select, open, etc.). One or morefunctional keys can also be manipulated to place the device in adifferent operational mode, where functional keys (or directional keys)are mapped to different navigation or functional operations. Forexample, where only one functional mode exists, the * key can be used toenter and exit the functional mode.

In the implementation shown, the keypad 212 includes four directionalkeys and eight functional keys. Alternatively, other numbers ofdirectional keys can be used that map to different navigationalparameters in different dimensional systems (e.g., three dimensionalsystems with six directional keys). In one implementation, depressing adirectional key results in movement of the cursor in a step operation onthe user interface of the controlled device. The size of the step can bepre-defined. The user interface at the controlled device can includesnap-to-grid functionality that allows for stepping across thepresentation area of the user interface by the cursor. Other kinds ofsteps (e.g., to a next item in a list or next object on a page, etc.)can be used.

In some implementations, the keypad 212 includes eight functional keysthat are mapped to keys 115 as follows: orientation command-1, clientcommand-3, UI command-7, selection-5, application command-9, text inputcommand, start/stop-*, modifier command-0 and macro command-#. Othermappings are possible. In addition, customized mappings can beassociated with a given user or controlling device 210. For example, thecontrolling device 210 is a telephone, then caller identificationinformation can be used to determine a mapping for the functional keysthat are customized for a given user. Customization of directional keysis also possible.

The orientation command key 231 can be manipulated to request audiofeedback for a current location (e.g., hovering over a send button) or acurrent environment (e.g., currently in e-mail application). The clientcommand key 232 provides application-level controls such as “readcurrent element of text” (word, line, or entire document),non-sequential navigation, and the like. For example, the client commandkey 232 can allow a user to navigate from a list of e-mails to thecontent of the e-mail in a non-sequential manner (e.g., allowing a userto view content of email in a list of emails, rather than merelyscrolling to a next email in the list).

The UI command key 233 when manipulated provides user interface-levelcontrols such as moving between windows or launching a program menu. Thetext input key 234 when manipulated allows a user to enter text into anapplication and can leverage technologies (e.g., Short Message Service(SMS)) to transmit the text to the controlled device. In someimplementations, the text input key 234 toggles between on and offstates that allows other keys to be mapped to particular textualelements. When toggled on, manipulation of the mapped keys producestextual output at the controlled device. The controlled device caninclude text completion routines that facilitate the easy entry of textthrough the controlling device 110. For example, text completionroutines can provide an audible feedback of a selectable “completion” toa partial text entry. In a text mode, one of the other keys not mappedto textual elements can be used to select the “completion” to save userkey strokes.

Start/stop key 235 allows for the entry or exit from a functional mode.In some implementations only one functional mode is provided. In someimplementations, the start/stop key 235 is only used to exit from thefunctional mode. For example, where only one function mode exists,another functional key can be depressed to start the functional mode.Alternatively, the start/stop key 235 can be used to toggle throughdifferent functional modes of the controlling device 210.

A modifier command key 236 can be used to modify or overload thefunctionality of the keys 215 in the same manner as the “Shift,” “Ctrl”and “Alt” keys of a QWERTY keyboard. If the user presses the modifierkey 236 followed by one of the other keys 215, then the key will performa different function. Alternatively, if the user holds down the modifierkey 236 while pressing one of the other keys, then the pressed key willprovide a different function.

A macro command key 237 allows for the selection of one or morepre-defined macro routines. A library of routines can be stored inmemory of the controlled device or the controlling device 210. Afterselection of the macro command key, a sequence number or combination ofkey strokes can be used to select from different routines stored in thelibrary for transmission to the controlled device. For example, scriptscan be stored for logging in, navigating to, and opening an emailapplication, opening an input box, and selecting a first item in theinbox for review. Alternatively, the macro command key 237 can be usedto store a sequence of key strokes for later retrieval. In someimplementations, after initiation of the macro command key 237, a recordstart sequence is entered. Thereafter, a record sequence stop key can bedepressed to end the recording session. A prompt can be provided by theinput device to designate the sequence for storage in the library foreasy future retrieval.

Controlled Device Software Architecture

FIG. 3 is a block diagram of an exemplary software architecture 300 fora controlled device. Other software architectures are possible. Forexample, the various components of the architecture 300 do not have tobe separate processes but can be part of a single process that isperformed by the operating system 302 or a single application.

Input signals from a controlling device (e.g., DTMF tones) are receivedby a hardware interface installed in or coupled to the controlled device(e.g., a modem, etc.). An operating system 302 and/or other low levelsoftware (e.g., device drivers, kernel) establishes communication withthe modem hardware and provides a software interface between thehardware and a telephony server process 304. The telephony serverprocess 304 provides telephony data to client applications 308. In someimplementations, the telephony server process 304 maps the telephonydata into control commands that can be received and interpreted by oneor more client applications 308. In other implementations, the clientapplications 308 receive telephony data in standard or intermediateformat and perform their own mappings. Still in other implementations, aseparate remote control application communicates with the telephonyserver 304 and performs mappings of telephony data for use by clientapplications 308.

In some applications, an additional authentication process 306 isperformed before client applications 308 can be controlled. Theauthentication process 306 can include known authenticationtechnologies. For example, the authentication process 306 can includeAES encryption (e.g., 128-bit keys) and passwords. In someimplementations, user authentication data received from the controllingdevice can identify a user profile for use in controlling a userinterface or performing other tasks at a controlled device.

In some implementations, a Caller ID can be used to start a securityevent or sequence. The controlled device can use a Caller ID to identifyusers, then perform security actions based on the identity of the users.For example, only certain individuals may be allowed to control thecontrolled device, while other users are blocked.

An example of a client application 308 suitable for being remotelycontrolled is the VoiceOver® application provided with Apple Computer,Inc.'s Mac OS® X v10.4 (“Tiger”) operating system. VoiceOver® readsaloud the contents of files, including web pages, mail messages and wordprocessing files, provides a comprehensive audible description of auser's workspace and includes a rich set of keyboard commands that allowusers to navigate the Mac OS® X interface and interact with applicationand system controls. The audible descriptions can be converted tosignals suitable for transmission to the controlling device, where theycan be reconstructed and played for the user through an audio system(e.g., telephone receiver). Thus, a user who is controlling anapplication or navigating a user interface can be guided by the audibledescriptions without an visual aids. Thus, standard POTs telephones canbe used to control tasks on an electronic device, which is not possiblewith conventional remote control technologies that provide only visualfeedback, and therefore require a display screen at the controllingdevice.

Controlled System Hardware Architecture

FIG. 4 is a block diagram of an exemplary hardware architecture 400 fora controlled device. The architecture 400 includes one or moreprocessors 402 (e.g., IBM PowerPC®, Intel Pentium® 4, etc.), one or moredisplay devices 404 (e.g., CRT, LCD) or no display devices, one or moregraphics processing units (GPUs) 406 (e.g., NVIDIA Quadro FX 4500,GeForce 7800 GT, etc.), one or more network interfaces 408 (e.g.,Ethernet, FireWire, USB, etc.), one or more input devices 410 (e.g.,keyboard, mouse, etc.), and one or more computer-readable mediums 412(e.g. SDRAM, optical disks, hard disks, flash memory, L1 or L2 cache,etc.). These components exchange communications and data via one or morebuses 414 (e.g., EISA, PCI, PCI Express, etc.).

The term “computer-readable medium” refers to any medium thatparticipates in providing instructions to a processor 402 for execution,including without limitation, non-volatile media (e.g., optical ormagnetic disks), volatile media (e.g., memory) and transmission media.Transmission media includes, without limitation, coaxial cables, copperwire and fiber optics.

The computer-readable medium 412 further includes an operating system416 (e.g., Mac OS®, Windows®, Linux, etc.), a network communicationmodule 418, client applications 420, a telephony server process 422 anda feedback generator 424.

The operating system 416 can be multi-user, multiprocessing,multitasking, multithreading, real-time and the like. The operatingsystem 416 performs basic tasks, including but not limited to:recognizing input from input devices 410; sending output to displaydevices 404; keeping track of files and directories on computer-readablemediums 412 (e.g., memory or a storage device); controlling peripheraldevices and processors (e.g., disk drives, printers, GPUs, etc.); andmanaging traffic on the one or more buses 414. The networkcommunications module 418 includes various components for establishingand maintaining network connections (e.g., software for implementingcommunication protocols, such as TCP/IP, HTTP, Ethernet, etc.).

A client application 420 can be any application or operating system taskthat is capable of being remotely controlled (e.g., desktop, email,media player, browser, word processor, accessibility application,assistant or wizard application, etc.), or a single application orprocess that facilitates the remote control of other applications. Thetelephony server process 422 communicates with telephony hardware (e.g.,a modem) and in some implementations maps telephony data into controlcommands for client applications 420. The feedback generator 424provides feedback to the controlling device. For example, the feedbackgenerator 424 can perform text-to-speech conversion, as described withrespect to FIG. 1.

The invention and all of the functional operations described herein canbe implemented in digital electronic circuitry, or in computer hardware,firmware, software, or in combinations of them. The invention can beimplemented as a computer program product, i.e., a computer programtangibly embodied in an information carrier, e.g., in a machine-readablestorage device or in a propagated signal, for execution by, or tocontrol the operation of, data processing apparatus, e.g., aprogrammable processor, a computer, or multiple computers. A computerprogram can be written in any form of programming language, includingcompiled or interpreted languages, and it can be deployed in any form,including as a stand-alone program or as a module, component,subroutine, or other unit suitable for use in a computing environment. Acomputer program can be deployed to be executed on one computer or onmultiple computers at one site or distributed across multiple sites andinterconnected by a communication network.

Method steps of the invention can be performed by one or moreprogrammable processors executing a computer program to performfunctions of the invention by operating on input data and generatingoutput. Method steps can also be performed by, and apparatus of theinvention can be implemented as, special purpose logic circuitry, e.g.,an FPGA (field programmable gate array) or an ASIC (application-specificintegrated circuit) or other customized circuitry.

Processors suitable for the execution of a computer program include, byway of example, both general and special purpose microprocessors, andany one or more processors of any kind of digital computer. Generally, aprocessor will receive instructions and data from a read-only memory ora random access memory or both. The essential elements of a computer area processor for executing instructions and one or more memory devicesfor storing instructions and data. Generally, a computer will alsoinclude, or be operatively coupled to receive data from or transfer datato, or both, one or more mass storage devices for storing data, e.g.,magnetic, magneto-optical disks, or optical disks. Information carrierssuitable for embodying computer program instructions and data includeall forms of non-volatile memory, including by way of examplesemiconductor memory devices, e.g., EPROM, EEPROM, and flash memorydevices; magnetic disks, e.g., internal hard disks or removable disks;magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor andthe memory can be supplemented by, or incorporated in special purposelogic circuitry.

To provide for interaction with a user, the invention can be implementedon a device having a display, e.g., a CRT (cathode ray tube) or LCD(liquid crystal display) monitor, for displaying information to the userand an input device, e.g., a keyboard, a mouse, a trackball, and thelike by which the user can provide input to the computer. Other kinds ofdevices can be used to provide for interaction with a user as well; forexample, feedback provided to the user can be any form of sensoryfeedback, e.g., visual feedback, auditory feedback, or tactile feedback;and input from the user can be received in any form, including acoustic,speech, or tactile input.

The invention can be implemented in, e.g., a computing system, ahandheld device, a telephone, a consumer appliance, or any otherprocessor-based device. A computing system implementation can include aback-end component, e.g., as a data server, or that includes amiddleware component, e.g., an application server, or that includes afront-end component, e.g., a client computer having a graphical userinterface or a Web browser through which a user can interact with animplementation of the invention, or any combination of such back-end,middleware, or front-end components. The components of the system can beinterconnected by any form or medium of digital data communication,e.g., a communication network. Examples of communication networksinclude a local area network (“LAN”) and a wide area network (“WAN”),e.g., the Internet.

A number of implementations have been described. Nevertheless, it willbe understood that various modifications may be made. For example, andas noted above, reference is made to a client-server telephony processin a controlled device. Other configurations are possible, includingthose with direct control of a controlled device by a controllingdevice. The user interface can include navigation objects such asgraphical buttons, fields, text boxes, text, application shortcuts,files, windows, menus, and the like. The user interface can map thenavigation objects to a dimensional space (e.g., in a 2-D spatialrelationship). In response to keypad input, the controlled device cansend spatial directions to the user interface (e.g., to control movementof a cursor in response to a directional key), or commands to movedirectly to a particular object (e.g., in response to a functional key).In response to a selection, the user interface can initiate actions suchas launching an application or traversing a file structure. In oneimplementation, the controlled device leverages an existing hierarchywithin an application when navigating. For example, a table embedded ina document can be navigated as an object rather than cell by cell. Inresponse to a user command that selects the object, navigation canproceed at the table-level between cells.

Though not discussed above, one of ordinary skill in the art willrecognize that an initial orientation may need to be required tofacilitate control of the controlled device. For example, initialorientation can be controlled by either an announcement to the userprior to an initial interaction or a pre-defined placement of the cursoror state of the device.

1. A method comprising: receiving an input from a telephony device;using the input to control a software user interface associated with acontrolled device, the software user interface having a plurality ofuser interface objects, wherein controlling the software user interface,comprises: directionally stepping through the plurality of userinterface objects displayed in the software user interface to select auser interface object of interest; activating the user interface objectof interest in the software user interface; and producing an outputassociated with the activation of the user interface object of interest;where a function mapped to an input provided at the telephony deviceinitiates gathering information to determine a state of the controlleddevice at the telephony device.
 2. The method of claim 1, where theinput is a tone.
 3. The method of claim 1, where the input is a voicecommand.
 4. The method of claim 1, where the telephony device is atelephone.
 5. The method of claim 1, where the software user interfaceis a graphical user interface.
 6. The method of claim 1, where theoutput includes audio signals of speech descriptions.
 7. The method ofclaim 1, where the output produces audio feedback in the telephonydevice.
 8. The method of claim 1, further comprising receiving userauthentication data from the telephony device.
 9. The method of claim 8,where the user authentication data identifies a custom user profile foruse in manipulating the software user interface.
 10. The method of claim1, where the input generates control commands for maneuvering through ahierarchical structure of user interface objects within applicationsassociated with the controlled device.
 11. The method of claim 1, wherea function mapped to an input at the telephony device provides access touser interface objects within an application associated with thecontrolled device.
 12. The method of claim 1, where a function mapped toan input provided at the telephony device provides access to a desktopfeature from a plurality of desktop features consisting of: searchingfor files or information, navigating the desktop or other userinterface, launching an application or operating system task, orselecting an item, icon or file.
 13. The method of claim 1, where afunction mapped to an input provided at the telephony device providesfor execution of an application-specific or client-specific command. 14.The method of claim 1, where a function mapped to an input provided atthe telephony device initiates a text-input mode.
 15. The method ofclaim 1, where the information is an audio signal of speechdescriptions.
 16. The method of claim 1, where the state is related tothe selected user interface object of interest.
 17. A computer-readablemedium having instructions stored thereon, which, when executed by oneor more processors, causes the one or more processors to performoperations comprising: receiving an input from a telephony device; usingthe input to control a software user interface associated with acontrolled device, the software user interface having a plurality ofsoftware display objects, wherein controlling the software userinterface, comprises: directionally stepping through the plurality ofsoftware display objects in the software user interface to select asoftware display object of interest; activating the software displayobject of interest in the software user interface; and producing anoutput associated with the activation of the software display object ofinterest in the software user interface; where a function mapped to aninput provided at the telephony device initiates gathering informationto determine a state of the controlled device at the telephony device.18. The computer-readable medium of claim 17, where the input is a tone.19. The computer-readable medium of claim 17, where the input is a voicecommand.
 20. The computer-readable medium of claim 17, where thetelephony device is a telephone.
 21. The computer-readable medium ofclaim 17, where the software user interface is a graphical userinterface.
 22. The computer-readable medium of claim 17, where theoutput includes audio signals of speech descriptions.
 23. Thecomputer-readable medium of claim 17, where the output produces audiofeedback in the telephony device.
 24. The computer-readable medium ofclaim 17, further comprising receiving user authentication data from thetelephony device.
 25. The computer-readable medium of claim 24, wherethe user authentication data identifies a custom user profile for use inmanipulating the software user interface.
 26. The computer-readablemedium of claim 17, where the input generates control commands formaneuvering through a hierarchical structure of software display objectswithin applications associated with the controlled device.
 27. Thecomputer-readable medium of claim 17, where a function mapped to aninput at the telephony device provides access to software displayobjects within an application associated with the controlled device. 28.The computer-readable medium of claim 17, where a function mapped to aninput provided at the telephony device provides access to a desktopfeature from a plurality of desktop features consisting of: searchingfor files or information, navigating the desktop or other userinterface, launching an application or operating system task, orselecting an item, icon or file.
 29. The computer-readable medium ofclaim 17, where a function mapped to an input provided at the telephonydevice provides for execution of an application-specific orclient-specific command.
 30. The computer-readable medium of claim 17,where a function mapped to an input provided at the telephony deviceinitiates a text-input mode.